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Abstract 



We solve the conjecture of Fill, Scheinerman and Singer-Cohen posed in [1] and 
show the equivalence of the sharp threshold functions of the random intersection graph 
Q (n,m,p) with m > n 3 and a graph G(n,p) with independent edges. Moreover we 
prove sharper equivalence results under some additional assumptions. 
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1 Introduction 

In an intersection graph there is a set of vertices V and an auxiliary set of objects W. Each 
vertex v G V is assigned a subset of objects W(v) C W according to a given probability 
measure. Two vertices v±, v-i are adjacent in a random intersection graph if and only if 
W(vi) nW(v2) 7^ 0. A general model of the random intersection graph, in which each vertex 
is assigned a subset of objects W(v) C W chosen uniformly from all <i-element subsets, where 
the cardinality d is determined according to the arbitrarily given probability distribution, 
^ ■ was introduced in [5]. 

We will concentrate on analyzing the properties of the random intersection graph in 
which the cardinality d is chosen according to the binomial distribution. Namely, we will 
investigate properties of the random intersection graph Q(n,m,p) introduced in [TU] . 
Q (n, m,p) is a graph with number of vertices |V| = n, number of objects \ W\ = m, in which 
each feature w is added to W{y) with probability p independently for all v G V and w G W 
(i.e. Pr {w G PF(u)} = p). However, to some extent, the results obtained may be generalized 
to other random intersection graph models due to equivalence theorems proved in Section 4 
in0. 

While introducing a new random graph model it is worth asking how it differs from those 
already studied. Therefore one of the first papers on the topic of the random intersection 
graphs was the one by Scheinerman, Fill and Singer-Cohen [I]. The aim of the paper was 
to compare Q (n, m,p) and random graph G(n,p) in which each edge appears independently. 
The value of p was set to be approximately Pr{(t> 1? t> 2 ) G E(Q (n,m,p))}. One of the con- 
clusions of the article was that the differences between G(n,p) and Q (n,m,p) are caused 
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by the dependency of edge appearance in the latter one and that the larger the m the less 
relevant the dependency. 

The main theorem in [I] states that for a > 6 the graphs G(n,p) and Q (n,m,p) have 
asymptotically the same properties. Moreover, it is pointed out that the theorem may be 
extended to smaller values of a if we make additional assumptions about p. The proof is 
based on the fact that for large a and relevant values of p, with probability tending to one 
as n — > oo, there are no properties assigned to more than two vertices and therefore the 
dependency between edges is asymptotically negligible. The authors of [4J suggest that the 
equivalence theorem is true for all properties for 3 < a < 6, i.e. in the case where the 
number of vertices assigned to each property is still small. 

The above mentioned result and conjecture are consistent with the simple observation 
that the number of vertices to which a given property w is assigned has essential impact on the 
dependency between edge appearance in Q (n, m,p). The edge set of the random intersection 
graph Q (n,m,p) is a union of cliques with the vertex sets V(w) := {v G V : w G W(v)}, 
w G W, and we may divide the set of edges of Q (n, m,p) according to the size of the clique 
in which it is contained. Let k > 2. We will denote by (n, m,p) a graph with a vertex set 
V and an edge set {(vi,v 2 ) : 3 w vi,v 2 G V(w) and |V(iu)| = k}. Alternatively we may define 
Qk {n,m,p) = G(Hk (n,m,p)), where Hk (n,m,p) is a hypergraph with a vertex set V and 
an edge set {(vi,v 2 , . . . , Vk) '■ 3 w V(w) = {vi, v 2 , . . . , Vk}} and for a hypergraph H a graph 
GTC is a graph with the same vertex set as the hypergraph TC and an edge set consisting of 
those pairs of vertices which are contained in at least one edge of 7i. Under this notation 
E(Q (n,m,p)) = U™ =2 E(GHk (n,m,p)) = [J™ =2 E(G k (n,m,p)). In [I] it is shown that for 
some m and p graphs Q (n,m,p), Q 2 (n,m,p), G(n,p) are asymptotically almost the same. 
To be precise Qk (n,m,p) are empty for k > 3 with probability tending to one as n — ► oo 
(we will say with high probability) and the edges in Q 2 (n,m,p) are almost independent. 

The authors in [3] support the conjecture for 3 < a < 6 by results concerning threshold 
functions for some properties of Q (n,m,p). However, it should be pointed out that if there 
exists C > such that 



then the expected number of edges in Q 3 ( n ? m , T>) tends to a constant or even to infinity. 
Therefore we may expect that the structure of Q(n,m,p) and G(n,p) differs. Namely, 
though the number of triangles in Q 2 (n,m,p) may make dominating contribution, the im- 
pact of triangles contained in Q 3 (n,m,p) on the structure of the random intersection graph 
cannot be omitted. As an example we may state the fact that for a = 3 the number of 
triangles in Q (n,m,p) and G(n,p) on the threshold of appearance (i.e. for p = c/n 2 and 
p ~ mp 2 = c 2 /n) has Poisson distribution with parameters (c 3 + c 6 )/3! and c 6 /3!, respectively 
(see [llj). For larger values of a the expected number of triangles in Q (n,m,p) and G(n,p) 
may also differ significantly. The same is true for cliques of size four contained in (n, m,p). 
In fact Qk (n,m,p) should be rather compared with GH^ (n,Pf-), where p\ is approximately 
the probability that for given {t> 1? . . . , v k } C V there exists w such that V(w) = {vi, . . . , t> fc }, 
H k (n,pk) is a A;-uniform random hypergraph with each edge appearing independently with 
probability p\ and GHk {n,Pk) is defined as above. The above observation leads us to the 
conclusion that the equivalence theorem may not be stated for 3 < a < 6 in such a general 
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form as it was for a > 6. Therefore we will draw our attention to the case of monotone 
properties. The concept of restriction of the equivalence theorems to the class of mono- 
tone properties has already been developed while examining the equivalence of G(n,p) and 
G(n,M) (see ME]). 

The article is organized as follows. In Section [2] we state and discuss the results. In 
Section H] we outline the proof of the main theorems. In Sections and we prove results 
needed to complete the proof. 

Throughout the article all limits are taken as n —* oo. We will also use standard Landaus 
notation O(-), G(-), fl(-), o(-), ~ (see for example [6]) and we will use the phrase 'with high 
probability' to say with probability tending to one as n — > oo. 



2 Result 

In our considerations we will draw our attention to Q (n,m,p) for 



(2) ni -±= ]=p = o' ""' 



m 



For the values of p significantly larger than y the graph Q (n, m, p) is with high probability 
a complete graph on n vertices (see [H [10] ) • And if 

(3) p = , 



then with high probability ^ (n, m, p) are empty for all k > 3. Therefore a slight modification 
of the proof from [4] gives us the result that Q(n,m,p) and G(n,p) are asymptotically 
equivalent for all possible properties. 

In fact we may state the following equivalence theorem. 



and 



Theorem 1. Let a G [0; 1], ^4 be any graph property, p = o (^ n ^ 

p — 1 — exp (— mp 2 (l — p) n ~ 2 ) ■ 

Then 

Pr {G(n,p) G A} -> a 

z/ and on/y z/ 

Pr (n, m, p) E A} — > a. 
For the proof see [1] and Lemma 

The result will concern monotone properties. The most important properties such as 
connectivity, having the largest component of size at least k, containment of the perfect 
matching or containment of a given graph as a subgraph are included in the wide family 
of the monotone properties. Let Q be a family of graphs with a vertex set V. We will call 
A C Q an increasing (decreasing) property if A is closed under isomorphism and G G A 
implies G' EA for all G' such that E(G) C E(G') (E(G') C E{G)). 
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Theorem 2. Let a G [0; 1], m = n a for a > 3 and A be any monotone property, 
(i) Let 



m 

and 1/nyfm = o(p) for a = 3. 

Pr {G (n, 1 - exp(-mp 2 (l - p) n ~ 2 )) G A} -> a 
and for all e = e{n) — > 

Pr {G (n, (1 + e)(l - exp(-mp 2 (l - p) n ~ 2 ))) G A} -> a, 

Pr (n, ra, p) G .A} — >■ a. 

fiij Let p = p n E [0; 1) 6e a sequence bounded away from one by a constant. 
If for all e = e(n) — > 



Pr { Q | /'■ //>. \/ -— — ^ | . I \ ■< 

m 



and 



Pr<S[n,m,,l-^-&\eA}^a, 



then 

Pr {G (n,p) G A} -> a. 

In (i) we have to exclude the case p = 0(l/n-v/m), since the thesis is not true on the 
threshold of the triangle appearance (see [TT]). 

The method of the proof is strong enough to show sharper results in many cases. For 
example, for a > 3 the function e{n) may be replaced by 1/n 6 , where 5 is a constant 
depending on a. We state here two theorems as the example of how tight the results may 
be, if we make some additional assumptions. 



Theorem 3. Let a G [0; 1], A be any monotone property , m = n a for a > 4 and 

nf-4=l =p = ° 



Inn 



rn 



Let 



P- = 1 — exp(— mp 2 {l — p) n 2 ); 

p + = 1 — exp(— mp 2 (l — p) n ~ 2 ) + 10 \/mp 3 
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Pr{G (n,pJ) E A} -> a and Pi{G (n,p+) E A} ->• a, 

Pr{£ (n,m,p) E A} ^ a. 

Theorem 4. Let a E [0; 1], c 3 = 30, c 4 = 157, A be any monotone property, m = n a for 
a > 10/3. Let 



exp( 


-mp 2 (l — 


vY~ 


" 2 ); 




exp( 


—mp 2 (l — 


v y- 


" 2 ) + 


C3 ^/mp 3 , 




for Q (n 


~ l m 


-1/3) 


= p = (n~ lr m~ l l 4 \ ; 


exp( 


—mp 2 (l — 


p) n - 


- 2 ) + 


C3 y/ mp 3 + c 4 \J mp 4 , 




for Q (n 


~ l m" 


-1/4) 


= p = (m-V2 i n V2 



Pr{G (n,p-) E A} -> a and Pr{G (n,p + ) E A} ^ a, 

then 

Pr{Q (n,m,p) E A} — > a. 



3 Auxiliary definitions, inequalities and facts 
3.1 Coupling 

We will frequently use a coupling argument. Let < P, ^;> be a countable partially ordered 
set. In our case P may be a subset of N with relation <, a Cartesian product N* with relation 
(xi, . . . ,xt) -< (yi, ■ ■ ■ , yt) ^ ^i<i<t%i < |/j or a set of hypergraphs Q on a given set of vertices 
with relation C of being a subhypergraph. In the article the set Q will be either the set of 
all graphs or hypergraphs on n vertices or the set of fc-partite graphs or hypergraphs with 
partitions with n vertices. To omit unnecessary formalities we will not say directly which 
partially ordered set we are considering, when it is obvious from the context. Let X and Y 
be two random variables with values in P. We will write 

if there exists a coupling (X, Y) of the random variables such that X -< Y with probability 
q (i.e. if there exists a probability space Q and two random variables X' and Y', such that 
X' and Y' are both defined on Q, have probability distribution as X and Y, respectively, 
and X' -< Y' with probability q). We will use the fact that such coupling exists if and only 
if there exists a probability measure |i:PxF-> [0; 1] such that for any set A C P we have 
H(A x P) = Pr{X E A} and /x(P x A) = Pr{Y E A} and (i({(x, y) E P x P; x -< y}) = q). 

Now we will state two useful facts. The simple proofs we add for completeness of consid- 
erations. 
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Fact 1. Let ¥ be a countable partially ordered set and X and Y be random variables with 
values in P. If 

(4) X 4i- qi Y and Y ^ q2 Z, 
then for some q < qi + q2 

Proof. Let /^i,/^ : P x P — > [0; 1] be probability measures associated with couplings existing 
by ® . Let P* = {y G P : Pr {Y = y} ^ 0}. Define 

W :PxP-xP^[0;l], w(j , g ,,) = g^^ ; 

fj:PxP-»[0;l], 2) = // 3 ({x} x P* x {*}). 

Then for Ai = {(x, y, z) : x -< y} and A 2 = {(x, y, z) : y -< z} we have 

z) :x ~< z}) = /J, 3 ({(x, y,z) : x ~< z}) > 

> n^Ax n A 2 ) > fi 3 (Ai) + MA) - 1 = 1 - (gi + g 2 ). 

□ 

Fact 2. // (X 1? . . . , X t ) and (Yi, . . . , Y t ) are vectors of independent random variables and 

(5) X % 4 qi Yi, for alll<i<t, 
then 

(X 1 ,...,X t ) ^ q (Xi,...,Y t ) 

and 

t t 

i=l i=l 

where q,q' > n*=i9t- 

Proof. For all 1 < i < t, let : P x P — > [0; 1] be a probability measure associated with the 
coupling existing by =4 q . Yi. Simple calculation shows that \i : P* x P* — ► [0; 1] such that 

t 

H(xi, ...,x t ,y 1 ,...,y t ) = Yl Vi(%h Vi) 

i=l 

implies the thesis. □ 
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3.2 Total variation distance 



Let X and Y be the random variables with values in the same countable set P. We define 
the total variation distance between X and Y by 

d TV (X, Y) = max | Pr{X e A} — Pr{Y e A}\ = - ^ | Pr{X = x} - Pr{Y = x}\. 

Now let P = Q be the set of hypergraphs (graphs) with a given vertex set. Let Gi and G 2 
be two random variables with values in Q . Since 

d T v (Gi, G 2 ) = l - I Pr{Gi = G} - Pr{G 2 = G}\, 

Gee 

it is simple to construct the probability measure \i on Q x Q such that /J,{(G, G) : G e Q} = 
1 — 2drv (Gi, G2) with marginal distributions as distributions of Gi and G 2 . This implies: 

Fact 3. 

Gi =4 q G 2 and G 2 =^ g / Gi, 

where q, q' > 1 — 2drv (Gi, G 2 ). 

In our considerations we will also use the following facts concerning total variation dis- 
tance, which are Facts 3 and 4 in [1]. 

Fact 4. Let A and A 1 be random variables with values in the same set. If there exist random 
variables B and B' such that for all possible b the distribution of A under condition B = b 
and the distribution of A' under condition B' = b are the same, then 

d TV (A, A') < 2d TV (B, B') . 

Fact 5. Let A and A 1 be two random variables. If there exists a probability space on which 
random variables B and B' are both defined and have probability distribution as A and A' , 
respectively, then 

d TV (A, A') < Pi{B ^ B'}. 

We will also use a standard result (see for example [1] equation (1.23)). 

Fact 6. Let A be a random variable with binomial distribution Bin(n,p) and let A' be a 
random variable with Poisson distribution Po(np). Then 

d TV (A,A')<p. 
3.3 Coupon collector model 

We will define two auxiliary random variables, which are generalized versions of random 
variables defined in [4]. Let K > 2 be a given constant integer, M be any random variable 
with values in N, n = (n 2 , . . . , n K ) be a vector of positive integers and P = (P 2 , . . . , Pk) 
be the vector of nonnegative reals such that ^Y^ = 2 n kPk — 1- Assume now that we have 
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Ylk=2 n k coupons Ufc=2{ c i fc ' \ • • • > c ™fc } an d one blank coupon d . We make M independent 
draws, with replacement, such that in each draw 

Pt{c\ is chosen} = P&, for 2 < k < K,l < i < n&; 

K 



Pr{o?o is chosen} = 1 — rikPk- 



k=2 



In this scheme we define Pu- k \M) to be a random variable denoting the number of times that 
a coupon q was chosen and 



x[ k) (M) 



1 if Rf\M) > 1; 
otherwise. 



The first auxiliary random variable is 

(6) X(M) =X(n,P,M) = (X (2) (M),...,X (X) (M)), where 

I«(M) = ^X 4 (fc) (M). 
i=i 

The second random variable is 

(7) Y = Y(n,P 7 ) = (Y {2 \ . . . ,Y {K) ) 

where P 1 = (P 2 ', . . . , P^) is a vector such that P' k < 1 for all 2 < k < K and 2 < k < K, 
are independent random variables with binomial distribution Bin^fc, P' k ). 

The simple observation stated below is a generalization of the part of the proof of 
Claim 1 in [1] and may be shown by careful calculation. 

Fact 7. Let M be a random variable with Poisson distribution Po(X), then pjf (M), 2 < 
k < K and 1 < % < n^, are independent random variables with Poisson distribution Po(XPk)- 
Moreover X^ k '(M) , 2 < k < K, are independent random variables with binomial distribution 
Bin^n^, 1 — exp(— APfe)). Therefore X(M) and Y have the same distribution for P' k = 1 — 
exp(-AP fe ). 

It is also simple to show the following fact. 

Fact 8. Let M and M' be random variables with values in N. If 

M 4i-o(i) M', 

then 

X{M) 4i-o(i) X(M'). 
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3.4 Chernoff bound 

We will use the Chernoff bound (see Theorem 2.1 in [6]). 

Lemma 1. Let X be a random variable with binomial distribution and A = EX. Let a > X, 
then 

Pr {X >a} < exp (-A - a In ^ + a ) 
After careful calculation we obtain the following lemma. 

Lemma 2. Let t > 1 be an integer and X n be a sequence of random variables with binomial 
distribution, such thatEX n = X n . Let e > and uj{n) be any function tending to infinity. If 

{(t + e) In nj (In In n — In A n ) , for X n = o(ln n) ; 
cj(n)A n , for \ n = 6 (Inn); 

(1 + e)X n , for lnn = o(A„), 

then 

Pr {X n >a n } = o (n - *) . 
Lemma 3. Let X n be a sequence of random variables with binomial distribution. Then 

Pr {X n < EX n - t n } < exp (-^^) , for t n > 0; 



Pr { X n > EX n + U < exp (-^^y J , /- t n > 0. 

It is also possible to formulate the version of the Chernoff bound for random variables with 
Poisson distribution. 

Lemma 4. Let X n be a sequence of random variables with Poisson distribution Po(X) and 
i > be any constant, then 

Pr {X n < EX n - U < exp (~^A + o (^) , for t n > 0; 

(10) ) \ /1\ 

Pr { X n > EX n + U < exp {- ^g+Q ) + o (- ) , for t n > 0. 

Proof. It follows by ([9]) applied to random variable with binomial distribution Bin(An J+1 , l/n l+1 ). 
definition of the total variation distance and Fact [6j □ 

4 Outline of the proof of theorems 

Let p- and p + be such that 

(11) G(n,p-) =^i- (i) G(n,m,p) and Q {n,m,p) 4i-o(i) G (n,p + ) . 
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Thus we may assume that there exists a probability space on which G (n,p_) and Q (n, m,p) 
(Q (n, m,p) and G (n,p + )) are defined and 



Pr{£_} = l-o(l) (Pr{£ + } = l-o(l)), 



for 



£_ : = {G(n,p_) C g(n,m,p)} (S+ := {</(n,m,p) C G(n,p+)}). 
Let a G [0, 1] and .4 be an increasing property. If 

Pr{G (n,p-) G A} -> a and Pr{G (n,p + ) G .4} -> a, 

then under the couplings implied by (JTT1) 



Pr{Q (n, m, p) G A} < Pr{£ (n, m, p) G A|£+} Pr{£+} + Pr{£$} < 

< Pv{G (n,p+) e A\£+}Pt{£+} + Px{£ c + } < 

< Pr{{G (n,p+) eA}f] £+} + Pr{£ c + } < 

< Pr{G (n,p+) eA} + Pr{£ c + } = 

= Pr{G (n,p+) G .A} + o(l) = a + o(l) 

and 

Pr{£ (n,m,p) G A} > Pr{£(n,m,p) G A|£_}Pr{£_} > 

> Pr{G(n,p_) G A|£_}Pr{£_} = 
= Pr{{G(n,p_) G A}n£_} = 

> Pr{G (n,p_) G A.} + Pr{£_} - Pr {{G (n,p_) G .A} U £_} > 

> Pr{G (n,p_) G A} + Pr{£_} - 1 = 
= Pr{G (n,p_) G A} + o(l) = a + o(l). 

Analogous equalities may be formulated for the decreasing properties or if we replace (TTT1) 
by 



(12) 



g(n,m,pJ) 4i-o(i) G(n,p) =4i- (i) G{n,m,p + ). 



Therefore the main aim of the proof is to show that under the assumptions of the theorems 
couplings ffTTT) and (IT21 exist. The existence of the couplings is implied by the following 
lemma. 



Lemma 5. Let 



(13) a n (q) 



(Q 

3 In n/(— In nq 2 + In In n) 
3 In n/ In Inn 
3 In n/ (In In n — 3 In nq 2 ) 
Ld(n)n 3 q 6 , where uo(n) —> 
^cn 3 q 6 , where c > 1 



/or ng 2 = 0(n -1 / 2 ); 

for nq 2 = o(l) and o(nq 2 ) = n^ 1 ^ 2 ; 

for nq 2 = 0(1); 

/or ng 2 — > oo and ng 2 = o(v^lnn); 
/or ng 2 — > oo and nq 2 = Q(\/\nn) 
for nq 2 — > oo and o{nq 2 ) = v^lnn; 



10 



P+ = s 



Moreover let c 3 > 2 ■ 3/^3!, c 4 > ^15 ■ 3 ■ 4/^4!, c 5 > \/2 2 ■ 3 ■ 5 3 -4-5/ y/h\, q k = 
(1 - exp(-mp k (l - p) n ~ fe )) 1/ (2) ; f or k = 2,3,4,5 and 

P- = Q2 

<?2 + a n {c 3 q 3 )c 3 q 3 , for p = ^O^m" 1 / 3 ) and 

p = o(min{n~ 1 m~ 1 / 4 , n _3 / 7 m -1 / 3 }); 
92 + Efc=3 a n(c k q k )c k q k , for p = Vt(n~ l m~ l/i ) and 

p = o(min{n~ 1 m~ 1 ^ 5 , n~ 3 / 7 m -1 / 3 , n~ 9 / 14 m~ 1//4 }); 
<?2 + EL3 a n(c k q k )c k q k , for p = Vt(n~ l m~ 1 / 5 ) and 

p = oimin^n^vnT 1 ^ ', ra _3 / 7 m~ 1 / 3 , ra~ 9//14 m _1//4 , ra _6 / 7 m _1 

T/ien 

G(n,p-) =4i-o(i) G(n,m,p) and Q (n,m,p) =4i-o(i) G (n,p+) . 

Now we will prove Theorems [H El and H] using Lemma 

Proof of Theorems^ {3^ and\4\ . It is easy to check that, under the assumptions of Theo- 
rem |2](i) about p, we have p + — (1 + o(l))g2- Therefore, for some £ 7 = e'(n) — > we have 
p + = (1 +e')(l — exp(— mp 2 (l —p) m ~ 2 ))- Thus by LemmaE] 

G (n, q 2 ) ^i- (i) (? (n, m, p) =^i- (i) G (n, (1 + £')g 2 ) , 

which implies Theorem |2Ji). 

Analogously in the proof of Theorem EJ^ii) we use the fact that, either p is such that we 
may use Theorem [1] or for some e'(n) — > we get 



Q ( n, m, \ j m 1+£ ' ^ | ^i-o(i) G (n, p) 



and 



G(n,p) 4i-o(i) Q [n,m, 



ln(l — p) 
(1 — e')m 



This implies Theorem EJ^ii). 

The proofs of Theorems H] and [3] are basically the same. We only have to calculate the 
value of a(c k q k )c k and notice that q k ~ ( 2 }/mp h for k = 3,4. □ 

Therefore we need to show Lemma [5] to complete the proof. 

Outline of the proof of Lemma In the statement of the lemma we have 3 different values of 
p + . They correspond to three cases. In the first case, since p = o(n _1 m~ 1//4 ), the hypergraph 
Hi (n, m,p) is empty with high probability. In the second case TC5 (n, m,p) is empty and in 
the third case - 7i 6 (n,m,p). We will prove Lemma El in all three cases at the same time. 
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The proof will differ only by the value of K, which is 3, 4 and 5 in the first, second and third 
case, respectively. 

Let m = n a and q k = (1 — exp(— mp k (l — p) n_fe )) 1 ^( 2 ) . We will prove that under assump- 
tions of Lemma [5] there exists a sequence of couplings 



(14) G (n, g 2 ) ^i- (i) 

(15) G2{n,m,p) =4x G(n,m,p) ^i_ (i) (J Q k (n, m, p) 



K 



k=2 
K 



(16) 4i-o(i) [jGH k (n } q { k 2 

k=2 ^ 

(17) 4i-o(i) G (n, g 2 ) U (J G (n, a n (c k q k )c k q k ) 



\k=2 



K 



<i G I n, g 2 + ^ a n (c k q k )c k q k 



Here 



fc=3 



(2) 1 r^rr f „ ,.(2, 



are independent random hypergraphs, 03,04, C5 are constants given in the statement of the 
lemma and a n (-) is a function given in the statement of the lemma. 

The left-hand side coupling of (TT5l) is trivial and the coupling (fTS]) is a standard one. The 
right-hand side coupling in ([15]) follows by the fact that under the assumptions of Lemma [5] 

Pr{3 wew \V(v)\ >K} = (mn K+1 p K+1 ) = o(l). 

Proof of the existence of the remaining couplings will require more effort. The couplings 
(fT4"|) and HTB]) follow by Lemma E] and Fact [3J 

Lemma 6. Let K > 2 be a constant integer and p = o(l/n), t/ien 

<W ( |J (n, m,p) , |J H k (n, 1 - exp(-m/(l - p) n - k )) J = o(l), 

\fe=2 fe=2 / 

where H k fn, 1 — exp(— mp k (l — p) n ~ fc )) are independent random hypergraphs. 

Lemma E] will be shown in Section 

The following lemmas imply existence of the coupling (1171) . 

Lemma 7. Let C3 > 2 • 3/v3! ; g = fi(n _1 ) and g = o(n~ 3 / 7 ). 

GH 3 (n, q 3 ) 4i-o(i) G ( (c 3 g)c 3 g) , 
where a n (g) is defined as in (fTB"]) . 



12 



Lemma 8. Let c 4 > ^15 • 3 ■ 4/^4!, q = Vt(n~ 2 / 3 ) and q = o(ra" 3 / 7 ). 

GH 4 (n, q 6 ) 4i-o(i) G ( (c 4 g)c 4 g) , 
where a n (q) is defined as in f[T3"j) . 

Lemma 9. Let c 5 > \/2 2 ■ 3 • 5 3 • 4 • 5/ g = ft(™~ 1/2 ) ? = o(™~ 3/7 )- 

Gi7 5 (n,g 10 ) ^i_ o( i) G (n, a„(c 5 g)c 5 g) , 
where a n (q) is defined as in . 

The above lemmas will be shown in Section [HI The coupling f[T7|) is a consequence of the 
above lemmas after substituting q = q^ for k = 3, . . . , K. □ 

Notice that although the expected number of hyperedges in GH k (n, q^J and cliques in 

G (n, q) is the same, the function a n is necessary. We will construct a coupling of two random 
graph models, the existence of which contradicts the thesis that there exists a constant C 
such that for all q 

GH 3 (n, g 3 ) ^i-o(i) G (n, Cq) . 

Let q = o(l). For any e, a 3-element subset of V, define F e to be the set of bijections 
assigning to the numbers from the set {1, 2, 3} the vertices of e (|F e | =6). Now, to each e, a 
3-element subset of V, and each function / e F e we assign / to e independently of all other 
functions and sets with probability 

r = l-(l-g 3 ) 1/6 ~^. 

Notice that if we add each edge e to the set of edges of the hypergraph with the vertex set V 
in the case when at least one function from F e is assigned to e, we will get a random variable 
with the same distribution as H 3 (n,q 3 ). Moreover we may construct a random subgraph 
G 3 of GH 3 (n, q 3 ) by adding an edge (vi,v 2 ), v%, v 2 G V, if and only if at least one 3-element 
subset of V containing v\ and v% is assigned a function in which V\ and v 2 are assigned 1 
and 2 or 2 and 1. Notice that, from independent choice of the functions from F e we get that 
each edge appears in G3 independently with probability 



r' = 1 - (1 - r) 2( "" 2) ~ 2nr 




Therefore 

G(n,r') 4i GH 3 (n,q 3 ) 
and in the lemmas there should be a n = Q(nq 2 ). 
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5 Proof of Lemma [6 

Let, for all 2 < k < K, 

p k =p k (l-p) n ~ k , n k =( 7 !), Pk = — jp- and P' k = 1 - exp(-mp k ), 

M have binomial distribution Bin(m, P) (P = J2k=2Pk n k), X(M) be defined as in ([6|) and 
Y be defined as in ([7j). Then for all 2 < k < K 

\E(H h {n,m,p))\=X { ®{M) and \E(H k (n, 1 - exp(-mp fe )))| = Y«. 

Moreover for any two hypergraphs H and H', such that for all k > 2 the number of edges of 
cardinality k in H and if' is the same, we have 

Pr[UH k (n,m,p) = h \ = Pr ( (J W fc (n,m,p) = H' 

and 

Pr < [J (n, 1 - exp(-mp fc )) = # > = Pr < [J F fc (n, 1 - exp(-mp k )) = H' 

lfe=2 J lfe=2 

Therefore, by Fact H] the following lemma implies Lemma El 

Lemma 10. Let K > 2 be a constant integer. Let p = o(l/n), M be a random variable 
with binomial distribution Bin(m,^2 k=2 ( k )pk), Pk — — p) n ~ k , X(M) be defined as in 
(J6j) for n k = (?) and P k = Pk/(J2k=2Pk n k) ■ Let moreover Y be defined as in (JTj) for P' k = 
1 — exp(— mp k ). Then 

d TV (X(M),F)=o(l). 



,fc=2 ) Kk=2 



K ~\ ( K 



The proof of the above lemma uses the main idea of the proof of Claim 1 in [1] . However, a 
slight modification of the choice of M and M' enables us to extend the result for a < 4. 

Proof. We replace the binomial random variable M with the Poisson random variable M' 
with the same expected value m^2f =2 (T)Pk- By Fact [7| the random variables X^'(M') are 
independent and 

Pr{xf ] {M') = 1} = 1 - exp(-mp fc ). 

Therefore in X(M') = (X^(M'), . . . , X {K \M')), the random variables X^(M'), X^(M') 
are independent with binomial distribution Bin(n 2 , 1— exp(— mp 2 )), • • • , Bin(n^, 1— exp(— mpx)), 
respectively. By definition of Y, Facts H] and O we have 



drv (Y, X(M)) = d TV (X(M'),X(M)) < 2d TV (M\ M) < 

K / N / h 

k=2 v ~ 7 \k=2 



2 E(!V = o(E"V)=o(i). 

t— 9 V / V I— 9 / 



□ 
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6 Proof of Lemma H, [8] and [9 

In fact, we will show the version of the lemmas in the case of random fc-partite hypergraphs 
and graphs. The following fact shows that we may reduce the problem to the /c-partite case. 

First let us introduce additional notation. Let Xi, . . . , X^ be disjoint n-element sets and 
r G [0; 1]. We define (n, r) to be the hypergraph with a vertex set Ui=i and an edge 
set being the random subset of £ : = {(xi, . . . , x k ) : Vi<i<k %i £ Xi} such that each element 
from 8 is added to E(H^ k ' (n,r)) independently with probability r. Let moreover {n,r) 
be the random fc-partite graph with ^-partition (X\, . . . , Xk) and each edge appearing with 
probability r. It is simple to prove the following fact. 

Fact 9. Let a n = If 

(19) GH^ k) (n, r( 2 )j =^i- (i) [n,a n r) , 

then, 

GH k (n, 1 - (1 - r©) fc! ) ^_ o(1) G (n, 1 - (1 - e^r)^) 
an<i if a n r = o(l), then for any constant c > kik — 1) 

GH k (n, fc!r(*)) 4i-o(i) G {n, c a n r) . 

Proof. Let Xi = {x[ , . . . , a4 }, for 1 < i < k, and V = {v i, . . . , t>„}. For a given instance of 
n, r^ 2 ) j (or (n, a„r)) we may get an instance of a hypergraph H k (n, 1 — (1 — A 2 )) j 
(or a graph G (n, 1 — (1 — a n r) fc ( fc_1 ^)) with the vertex set V by merging for all 1 < j < n all 
the vertices Xj, 1 < i < k, into t> j and deleting the edges with less then k (or 2) vertices. □ 

In the remaining part of the section we will prove the following lemmas. 

Lemma 11. Let q = f2(n -1 ) and q = o(n^ 1 / 3 ). 

GH^ (n,q 3 ) ^_ o(1) G® (n,a n (q)q), 

where a n (q) is defined as in (TT3"j) . 

The above lemma is a generalization of Theorem 1.7 from [8], where it was shown in the 
case (lnn/n 2 ) 1 / 3 <C q = o(n -3 / 5 ) and a n = 17. Moreover we will prove the analogue of the 
lemma for the case of 4-partite and 5-partite hypergraphs. 

Lemma 12. Let q = Q(n~ 2 / 3 ), c'^ > \^15 and q = o(n~ 2 / 5 ). 

GH& (n,q 6 ) =^_ o(1) G^ (n,a n (c' 4 q)c' 4 q) , 
where a n (q) is defined as in (|T3|) . 



Lemma 13. Let q = VL( n - 1 / 2 ), c' 5 > y/2 2 ■ 3 • 5 3 and q = o(n~ 2 / 5 ). 

GH® (n,q 10 ) 4i-o(i) G (5) (n,a n (c' 5 q)c' 5 q) . 
where a n (q) is defined as in ffT3"j) . 
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We will prove in detail Lemma [TTJ The proof of the remaining lemmas is analogous, 
therefore we will only sketch them. 

Proof of Lemma [771 First we will divide the hypergraph H^' (n, q 3 ) into n, independent, 
edge disjoint hypergraphs H(x), x G X k , such that 

H^(n,q 3 ) = |J H(x). 

For any x G X 3 , H(x) will be a hypergraph with the vertex set {x} U X\ U X 2 and an edge 
set consisting of those edges from E(H^ (n, g 3 )), which contain x. 

Next, for each x separately, we will compare GH(x) with an auxiliary graph T(x). By 
definition, T(x) will be a graph with the same vertex set as H(x) and with an edge set con- 
structed by the following procedure. First we add each edge (x, y), y G X 1 UX 2 independently 
with probability Cq to the edge set, where 

{c, where c > 5, for nq 2 = o(l); 

u)(n), where uj{n) — > oo, for nq 2 = Q(n); 
cnq 2 , where c > 1, for nq 2 — > oo. 

(We assume, that u(n) tens slowly to infinity and c is close to 5 and 1, respectively) 
Then independently with probability q we add to the edge set each edge (xi, £2) G X{ x X£, 
where, for each 1 < i < 2, X* is the set of vertices form Xi connected by an edge with x. 
We will prove that 

(21) GH{x) 4i-o(i/n) T(x). 
By definition of T(x), (12 ip is equivalent to 

(22) H*(x) ^-od/n) T*(x), 

where H*(x) is a graph with the vertex set X\ U X 2 and an edge set {(£1,22) : (xi,x 2 ,x) G 
H(x)} and T*(x) is a subgraph of T(x) induced on X\ U X 2 . 
Then we will show, that 

(23) |J T(x) ^ o{1) (n,a n (q)q) . 

x£X 3 



To get fl23l) we will show that 
(24) |J T*(x)^ o{l) H^(n,a n (q)q) 

and (n, a n (q)q) is independent of the choice of X*. This by definition of a n (q), T(x) and 
T*(x) is equivalent to (1231). 

Therefore the main parts of the proof will be to show fl22l) and ([2] 



Proof of 

We will divide the proof into four cases 
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1. q — 0(hxn/n), 

2. Inn/n <C q and q = 0((n 2 Inn) 1 / 3 ), 

3. (T^km) 1 / 3 <C q and q = o(n^ 1 ^ 2 ), 

4. q — fi(n~ 1//2 ) and q = o(n~ 1//3 ), 

CASE 1 

In this case we will use the fact that with probability 1 + o(l/n) the graph H*(x) consists 
of at most one edge. Namely the probability that the graph H*(x) has more than one edge 
is at most 




Moreover, for large n, 

PT{3 xieXl>X2eX2 ( Xl ,x 2 ) E E{T*{x))} > Pr {(^i^2) G E{T*{x))} 

£ Pr{(x 1 ,x 2 ),(x / 1 ,x' 2 )G J E;(r*(x))} 

xi,ir^eA'i,a;2,^2 e '^' 2 

= n 2 (Cg) 2 g - Q V<?)V - 2nQ (C ff )V = 
= CVg 3 (1 - 0(n 2 q 3 + nq 2 )) > n 2 q 3 > 

This gives an obvious coupling 

-o(l/n) T*(x). 

CASES 2, 3 and 4 

In the three latter cases we will use the fact that the number of the vertices in X* is 
sharply concentrated around its expected value. Let r 6 [0; 1], we denote by Hi {C'nq, r) a 
2-partite graph with 2-partition (Xi, X 2 ) and with an edge set constructed by the following 
procedure. Let 

{5, for nq 2 = o(l); 

oj'(n), for nq 2 = 9(1); 

c'nq 2 , where d > 1 for ng 2 — > 00, 

be such that C/C" > 1 is a constant. First, for each 1 < i < 2 independently, we choose 
X[ uniformly at random from all C'nq element subsets of X^ and then we add each edge 
(xi,x 2 ) E X[ x X' 2 to the edge set independently with probability r. By Chernoff bound (jUJ) 

(C'nq,q) ^_ o(1/n) T*(x). 
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Therefore, since in H*(x) each edge appears independently, in order to show (|22[) we will 
show that in all three cases 

# (2) M 3 ) ^i- (i/n)^ 2) (C'nq, q 3). 

CASE 2 

In this case we will use the fact that (n, g 3 ) (i.e. also H*(x)) with high probability 
does not contain many edges except the maximum matching. First we will show auxiliary 
lemmas. 

Lemma 14. Let r = o(l/(Cnq)), Inn = o(nq) and N 2 (r) be the random variable denoting 
the size of the maximum matching in (Cnq,r), then 

(25) N' 2 {r) 4i-o(i/n) N 2 (r), 

where N 2 (r) has binomial distribution Bin(Cnq, s 2 (r)) and 

s 2 (r) ~ C'nqr. 

Proof of Lemma \TJ\ Let if be a hypergraph chosen according to the probability distribution 
of Hi {Cnq, r). Define H' to be the hypergraph with a vertex set X\ and an edge set {(xi) : 
x\ G X\ and 3 X2€ x 2 ( x i-> x z) G E(H)}. Notice that H' is chosen according to the probability 
distribution of H* (Cnq, 1 — (1 — r) c ' nq ) (H* (Cnq, 1 — (1 — r) c ' nq ) is a hypergraph with 
a vertex set X\ and the edge set constructed by first choosing X[ uniformly at random from 
all Cng-element subsets of X\ and then adding to the edge set each x\ e X[ independently 
with probability 1 — (1 — r) Cnq ). Let H" be the subhypergraph of H such that for each 
edge (xi) G E(H') we pick uniformly at random an edge from E(H) containing X\ and 
add it to the edge set of H" . Notice that the maximum matching in H is at least of the 
size of the set of non isolated vertices in X 2 in H". Moreover the edge set of H" may be 
alternatively constructed in the following way (i.e. this construction will give us the same 
probability distribution). First we pick an integer according to the binomial distribution 
Bin(C'nq, 1 — (1 — r) Cnq ), then, given the value of the picked integer, we pick a subset X" 
uniformly at random from all subsets of X\ of this cardinality. Independently we choose X 2 
uniformly at random from all C'ng-element subsets of X 2 . Then to each vertex x\ G X{, 
to create an edge, we add one vertex, chosen uniformly at random from the set X 2 . For all 
X\ G X[ the choices of the second vertex are independent with repetition. Therefore by the 
above construction, (TTUT) and Fact M 

X(M) 4i-o(i/n) X(C'nq) ^ N 2 , 

where X(M) and X(C'nq) are defined as in for K = 2, n 2 = Cnq, P 2 = (1 — (1 — 
r) c nq )/(C'nq) and M with Poisson distribution Po(C'nq — ^"iC'nq Inn). Thus by Fact [7] 
X(M) has binomial distribution Bin(C"ng, s 2 (r)), where 

s 2 ( r ) = 1 - exp(-(C"ng - v^Cng In n)(l - (1 - r) c ' nq ) / Cnq) = 
= 1 — exp ((1 + o{l))Cnqr) ~ 
~ C'nqr. 

□ 
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Using the above lemma we will show the existence of the coupling between the random 
variable M 2 denoting the size of an edge set in [n, g 3 ) and N 2 . 

Lemma 15. Let C = 5, M 2 have binomial distribution Bin(n 2 ,q 3 ) and let N 2 be the size of 
the maximum matching in (C'nq, q) . Then 

M 2 =4l-o(l/n) N 2 . 

Proof. By previous lemma and Fact [TJ it is sufficient to show 
(26) M 2 ^ o{1/n) N' 2 , 

where N 2 has binomial distribution Bin(C"ng, s 2 (q)) for some s 2 (q) ~ C'nq 2 . 
Notice that 

nq I ' n \ 
M 2 —} £j, where £j are independent with distribution Bin I — , g 3 J ; 

tr V9 J 

nq 

N 2 = Q, where Q are independent with distribution Bin (C", 52(g)) • 
Since s 2 (q) ~ C'nq 2 , for large n we have 



V x <K 4 Pr{& = l}< hnq 2 ) 1 < ^4(1 - s 2 ) c '~ l = Pr{& = 1} 



and 



Pr{£< > 4} < Q)g 53 < (ng 2 ) 5 = -L (ng§) V = o 

Therefore, for all 1 < i < nq it is simple to construct the probability measure on N x N, 
which existence implies 

£i ^1— o(l/n 2 p) Ci- 

This by Fact [2] implies §M) . □ 

Since we will compare the sizes of the maximum matchings in (n, g 3 ) and (C'nq, q), 
we will introduce the additional notation. Let Q be a set of 2-partite graphs with 2-partition 
(X X ,X 2 ). We define 

M. (I) - the subset of Q containing all graphs with a maximum matching of cardinality I; 
M.\(l) - the subset of M.(l) containing all graphs with the maximum degree 1; 
M. 2 (l) - the subset of M.(l) containing all graphs with the maximum degree 2 and 
exactly one vertex of degree 2 

and 

n n 

Mi = \jM l (l), M 2 = \jM 2 (l). 

1=0 1=0 
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For q = o(n 2 / 3 ) 

Pt{hW ( n ,q 3 ) i Mi} < 2n(fjq 6 = O (A 6 ^" 1 ) = o 

and for q = {[n~ 2 Inn) 1 / 3 ) 

Pr{# (2) (n,q 3 ) £ Mi U M 2 } < 




Now let 

/i:NxN->[0,l] 

be the probability measure associated with the coupling of M 2 and N 2 from Lemma [15j Let 
Q be the set of all 2-partite graphs with 2-partition (A4, X 2 \ Starting with the probability 
measure /1 we will construct the coupling, which implies for large n 

H {2) {n,q 3 ) ^ o{1/n) H^ (C'nq,q). 

We introduce additional notation. Let H^ 2 '(Mi) be the random graph constructed in 
the following way. First we sample H according to the probability distribution of H ^ (n, q 3 ) 
and if H is not in Mi, then we replace it by a graph chosen uniformly at random from 
Mi(\E(H)\). Moreover let H^ 2 \M\ U M2) be the random graph constructed by sampling 
H according to the probability distribution of (n, q 3 ) and replacing it by a graph chosen 
uniformly at random from Mi(\E(H)\) in the case if it is not contained in MiUM 2 . The sizes 
of the edge sets of H^(Mi) and H^(M 1 LiM 2 ) are random variables M 2 (M\) and M 2 (MiU 
M 2 ), respectively. Obviously M 2 (Mi) and M 2 {M\ U M 2 ) have the same distribution as 
M 2 . For any event A, denote by H^{Mi) [A \ H^(Mi U M 2 ) [A] and H {2) (C'nq,q) [A] the 
graphs H^lMi), H (2 \Mi U M 2 ) and #i 2) (C'nq,q) under condition A. 

In the case q = o(n~ 2 ' 3 ) the construction of the coupling is simple. By the above 
calculation 

( n ,q 3 ) 4i-o(i/n) H<V{Mi), 

Therefore we will only show 

HW(M{) 4i-o { i/ n) H^ (C'nq,q). 

Let (h,l 2 ) G N x N be chosen according to the probability measure \i. If l\ > l 2 , then we 
sample (Mt) lM2=h] and H^ 2) (C'nq, q) [N2=h] independently. And if l x < l 2 , then first we 
sample an instance of (C'nq, q^ N2 ~ 1 ^ anc l then choose its subgraph uniformly at random 
from all its subgraphs contained in Mi(l 2 ). Then, from the chosen subgraph, we delete l 2 — l\ 
edges chosen uniformly at random. In this way we get an edge set of H^(Mi) [M2=h] . 
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Now we will construct a coupling in the case 

P 1 (l) = Pr{#( 2 )(Mi u M 2 ) [M2=l] e M x }] 
Qi(Z) = Pv{H^(Mi U M 2 ) [M2=l] i Mi}; 



q = Q(n~ 2 / 3 ) and q = O ((n~ 2 In n) 1 / 3 ) . Let 

P 2 (l) = Pr{H^ (C'nq,q) [N2=l] G M 1 }; 
Q 2 (l) = Pr{H^ (C'nq,q) [N2=l] £ Mi}. 



We will show that 

H^iMi U M 2 ) 4i- (i/n) (C'nq, q) . 

Let Z2) G N x N be chosen according to the probability measure u. If l± > l 2 or l 2 > 
uj{n) Inn (where uj(n) is a sequence tending slowly to infinity), then we construct a pair of 
graphs from Q by sampling independently H^(Mi U M. 2 )\ M2=ll ~\ and (C'nq,qf N2 ~ l2 \ 
If 1 < li < l 2 < c<j(n)lnn, then we sample H, a second graph in a pair, according to the 
probability distribution of (C'nq,q)^ N2 ~ l2 \ If H G .Mi, then we choose a first graph 
uniformly at random from all subgraphs of H contained in Mi(li). If H ^ Mi, then with 
probability (Pi(/i) — P 2 {l 2 ))/Q 2 {l 2 ) we choose a first graph uniformly at random from all 
subgraphs of H contained in Mi(h) and with probability Qiili) /Q 2 {1 2 ) we choose a first 
graph uniformly at random from all subgraphs of H contained in M 2 (h). 

According to this construction the first graph is chosen according to the probability 
distribution of H^ 2 >(Mi UM 2 ) and the second according to the probability distribution of 
H^ 2 ' (C'nq, q). Moreover 



H({(h,l 2 ) : h > l 2 }) = o 




In addition, the size of the maximum matching (i.e. N 2 ) is at most the number of edges 
of (C'nq,q), which has binomial distribution with expected value (C'n) 2 q 3 = O(lnn). 
Thus by Chernoff bound (Q 



fi{{(h, h) ■ h > w(n) Inn}) = o 




Therefore this is a desired coupling and it is well defined for large n if Pi{h) > P 2 {1 2 ) for 
large n and h < l 2 . Calculations show that for a given / < oj{n) Inn and u{n) tending slowly 
to infinity 
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and 



Q 2 (l) = Pi{H^ (C'nq,qf 2 - l] <£ M,} > 

Pr{#* (2) (C'nq, q) [N2=l] £ Mi} - Pi{H {2) (C'nq, q) [N2=l] ^jU M 2 } 



since 



> 



1 - Pr{#i 2) (C'nq, q) [N2 ~ l] $ Mi U M 2 } 
Pr{#i 2) {C'nq,q) [N2=l] G M 2 } 



Pr{Hi 2) (C'nq, q) [N2=l] G M x } + Pr{#i 2) (C'nq, q) [N2=l] G M 2 } 

Pr{ffj 2) (C'nq,q) G M 2 (l)} = 

Pr{^i 2) (C'nq, q) G M x (l)} + Pr{#i 2) (C'nq, q) G M 2 (l)} ~~ 
Q (nq 2 l) = Q(n- 1/3 l), 



Pr{H^ (C'nq,q)eM 2 (l)}-- 
C'nq\ fC'nq\ (I + 1 



1+1 



2 



i + ij \ i j v 2 y v ; V 1 - ?/ 

Cng^ 2 (gng - /) (Z + 1)Z / g_ V _g_ ^(Cnq) 2 _ 
Z J (l + l) ' 2 l-g l q) 

= Pr{H^ (C'nq, q) G Mi(l)}(\ + o(l))^^ 

Hence Qi(Zi) = o(Q2(^)) uniformly over all 1 < Zi < l 2 < cu(n) Inn and c^(n) such that 
^(n) Inn = o(nq). 



CASE 3 and 4 



In this case we will use the fact that the number of edges in (n, q 3 ) and H; 2 ' (C'nq, q) 
is sharply concentrated around its expected value. Therefore in order to compare (n, q 3 ) 
and (C'nq, q) we will compare the auxiliary random graphs. In fact we will construct a 
coupling of the degree sequences of some multigraphs of which (n, q 3 ) and (C'nq, q) 
are underlying graphs. 

By Fact [7] graphs (n, q 3 ) and (C'nq, q) are underlying graphs of multigraphs 
H**(x) and T**(x) with the Poisson distribution of the number of edges with expected 
value — n 2 ln(l — q 3 ) and —(C'nq) 2 ln(l — q), respectively and an edge set constructed by 
independently choosing one by one with repetition edges from the edge set of the complete 
2-partite graph with 2-partition (X\, X 2 ) and (X[, X' 2 ), respectively. By Chernoff bound ffTUl) 

H**(x) ^i_ o( i /n) H***(x) and T***(x) ^i- o{ i /n) T**(x), 

where H***(x) is the multigraph with C\n 2 q 3 (C\ > 1 is a constant) edges and T***(x) is 
the multigraph with (C") 2 n 2 q 3 edges (where C'/C" > 1 is a constant). Notice that choosing 
one edge is equivalent to choosing its 2 vertices independently from each set of 2-partition. 
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Therefore we may independently choose the degree sequence in each set Afj (X[) and then 
create the multigraph with a given degree sequence. Thus, by Fact [2J to prove that 

H***(x) ^x_ 0(1/n) T***(x), 

we have to prove for each X^, 1 < i < 2, that 

(Df DW)^_ o(1/n) (©S 5) ,...,D(f)), 

where D^ 1 is the random variable denoting the degree of the j-th vertex in Xj in H***{x) 
and is the random variable denoting the degree of the j-th vertex in X{ in T***(x). 

We introduce the auxiliary urn models. Assume that we have n urns. Let ID)^ = 
(P>i \ ■ ■ ■ , Bn ) be a random vector in which stays for a number of balls in j-th urn 
in the z'-th model. Let 1< C x < C 2 , C'"jC"" > 1, C"/C" > 1 and C'/C" > 1 be constants 
such that C 2 < C"" . 

• In the 1-st model we throw Cin 2 q 3 balls one by one independently, with repetition, to 
the urn chosen uniformly at random from n urns. 



In the 2-nd model the number of thrown balls has Poisson distribution Po(C2n 2 q 



2, ,3^ 



i.e. by Fact [7] D^- has Poisson distribution Po(C 2 ng 3 ) (for K = 2, P 2 = -, n 2 = n) 



• In the 3-rd model ©!• = Dj ■ Dp where Dj is a Bernoulli random variable with 
probability of success C""q and D'- has Poisson distribution Yo{C""nq 2 ). 

• In the 4-th model first we select C"'nq urns from the set of all urns and the number 
of balls thrown to the selected urns has Poisson distribution Po((C /// ) 2 n 2 g 3 ), i.e. for 
the urns not selected H>j = and for the selected urns has Poisson distribution 
Po(C"'nq 2 ) (by Fact[7]for K = 2, P 2 = n 2 = C'"nq). 

• In the 5-th model first we select C"nq urns and we throw (C") 2 n 2 q 3 balls one by one 
independently to the urn chosen uniformly at random from the set of selected urns. 

By Chernoff bound 

^l-o(l/n) © (2) and D (3) 4l-a(l/ n ) E> (4) ^l- o( l/n) ^ ■ 

Moreover, by Fact [21 if for large n 

(27) Df 4i-o(i/ n i) Dj ■ D' p 
then for large n 

(28) D (2) 4i-o(i/n) © (3) 
The constants may be chosen such that 

4, for nq 2 = o(l); 

C"" = { uj""(n), for nq 2 = 9(1); 
c""nq 2 , for nq 2 — > oo, 
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where c"" > 1 and oj""(n) is a function tending slowly to infinity . 
For large n 

Pr jof ) > l| = 1 - exp(-C 2 rag 3 ) < 

< C""q(l-exp (-(C"")nq 2 )) = 
= Pr |dJ 3) > lj. 

Moreover, for t > 2, nq 2 = o(l) and large ra 

Pr { D f > t} ~ (C2 ^ 3)f = o ( W^' ) = o (Pr {©f > f}) 

This implies f l2"T|) for rag 2 = o(l). 

Let now ng 2 = By Chernoff bound, if we estimate the number of urns with at least 

one ball and compare it to the number of balls we will get, that with probability 1 — o(l/n) 
the number of urns with at least 2 balls in the 3-rd model is o(ra 2 g 3 ) and f2(ra 2 g 3 ) in the 2-nd 
model. Therefore, since urns with at least 2 balls are uniformly distributed, the coupling is 
easy to construct. 



Proof of (PJU) 



Let C = C(q) be defined as in (BUI) . Define X n (x%, x 2 ) = \{x G X 3 : X\ G X*(x), . . . , x 2 G 
<Y 2 *(a;)}|. X n (x\,X2) has binomial distribution Bin(ra, (Cg) 2 ). Then for 

{crag 2 , where c > 25 for nq 2 = o(l); 
u 2 (n)nq 2 , for nq 2 = 9(ra); 

cra 3 g 6 , where c > 1, for rag 2 — > oo. 



Therefore by Lemma [3] 

Pr{3(x 1 ,x 2 )X n (x 1 ,x 2 ) > <(g)} < n 2 Pr {X n (xi,x 2 ) > <(<?)} 

where 



o(l) 



a 'n((l) 



31nra/(lnlnra — ln(rag 2 )), 
3 In ra/ In Inn, 
31nra/(lnlnra — ln(ra 3 g 6 ), 
u>i(n) ra 3 g 6 , where 0J\(n) 



cn q 



3 g 6 , where c > 1 



for nq 2 = o(l); 
for rag 2 = O(n); 
for rag 2 — > oo and rag 2 = 
oo for rag 2 — > oo and rag 2 = B(v^hTra); 
for rag 2 — > oo and o(rag 2 ) = yln 



o(^hT 



n 



n. 



(since u{n) may tend to infinity arbitrarily slowly). 
Thus 

|J T*(x)4i-o(i)H^(n,a n (q)q). 



□ 
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The proof of the remaining two lemmas is analogous. However we only need to consider 
the analogues of CASE 2 and 3, which shortens the proof. 

Proof of Lemma 1TB and [751 Let k = 4 or k = 5. Similarly to the previous proof we will 
divide the hypergraph ( n, gW ) into n edge disjoint hypergraphs H(x), x G X k , such 



ff( fe )(n,g©) = |J H{x). 



that 

x£X k 

For any x G X k , H(x) will be a hypergraph with the vertex set {x} U |Ji=i ^% ano ^ an edge 
set consisting of those edges from E (jiiQ^jji which contain x. 

T(x) will be an auxiliary hypergraph, which has the same vertex set as H(x) and an 
edge set constructed by the following procedure. First we add each edge (x,y), y G Ui=i ^ 
independently with probability Cq (C > 5) to the edge set and then independently with 

probability q( 2 ) we add to the edge set each edge (x\, . . . , Xfc-i) G X£ x . . . x where, 
for each 1 < i < k — 1, X* is the set of vertices connected by an edge with x. Therefore we 
have to prove that 

(29) GH(x) 4i-o(i/n) GT(x) 
and that 

(30) |J GT(x) 4i-o(i) (n, a(c' k q)c' k q) . 

xdX k 

To get (1301) we will show that if T*{x) is a subhypergraph of T(x) induced on Ui=i 
then for c > 5(/c — 1) 

(31) |J T*(x)4i-o { i)H^(n,cq(^ 

x£X k 



where H^ k ^ (n,cq^ 2 )j is independent of choice of X*, and then we will use the lemma 
for {k — 1) to arrive at 

|J GT*(x) 4i- o{ i) GO- 1 * (n,a n (c' k q)c' k q) , 
xex k 

which by definition of T(x) and T*(x) implies (1301) . 

The proof of (1291) in CASE 2 and 3 is similar to the proof of (I22p . We will only sketch 
the proof of fl3Tl) . 

Define X n = X n (xi, . . . , Xk-i) = \{x G X k : X\ G X*(x) . . . Xk~i G A'^_ 1 (x)}|. It has 
binomial distribution Bin(ra, {Cq) k ~ l ) and for large n 

EX n = C 3 nq 3 < n- 1/5 and EX n = C 4 nq 4 < n~ 3/5 . 

Therefore, since 

Inn Inn 5 

5 and 



In In n — Inn" 1 / 5 In In n — Inn" 3 / 5 3' 
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by Lemma [3] for any constant c'l > 15 and c" > 20/3 

Pi{3 {xi ^ Xk _ l) X n (x 1 , . . . , > 4} < n^ 1 Pr {X n (xi, . . . , z fc _i) > 4} = o(l). 
Thus in the case A; = 4, for any constant C4 > \^TE, we have 

|J T*(x) ^_ o(1) ffW (n,(clg) 3 ). 

Thus, by Lemma fTTj 

|J T*(x) 4i-o(i) G {3) (n, a n (c' 4 q)c' A q) . 

Finally, since a n (c' 4 g) > 5, 

GH& (n,q 6 ) 4i-o(i) G (4) (n,a n (c',q)c',q) . 
Analogously, for k = 5 and c > ^/20/3 

|J T*(x) ^_ o(1) ff« (n,(cq) e ). 

x£X 5 

Therefore by Lemma O for c' 5 > ^20/3^^15 = y/2 2 ■ 3 • 5 3 

[J T*(:r) ^i- (i) G (4) (n, a n (c' 5 q)c' 5 q) , 

which implies the thesis. □ 
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